Measures of spread are ways of summarizing a group of data by describing how spread out the values are. If the spread of values in the data set is large, the mean is not as representative of the data as if the spread of data is small. This is because a large spread indicates that there are probably large differences between individual data points.
The spread of the values can be measured for quantitative data, as the variables are numeric and can be arranged into a logical order with a low end value and a high end value. Measures of spread are used in conjunction with a measure of central tendency, such as the mean or median, to provide an overall description of a set of data. To describe spread, a number of statistics are available, including the range, quartiles and standard deviation. Which you use will depend on how much and the type of data you collected.
What Measure of Central Tendency Did you Calculate? | |
Mean with 5 or more trials per level of manipulation | Standard Deviation |
Mean with 4 or less trials per level of manipulation | Range |
Median with 5 or more trials per level of manipulation | Quartile |
Median with 4 or less trials per level of manipulation | Range |
Averages do not tell us everything about a sample. Samples can be very uniform with the data all bunched around the mean (Figure 1) or they can be spread out a long way from the mean (Figure 2). The statistic that measures this spread for normally distributed data is called the standard deviation. The wider the spread of scores, the larger the standard deviation.
For data that has a normal distribution, 68% of the data lies within one standard deviation of the mean.
Calculating the Standard Deviation in Sheets
| Using Excel to Calculate the Standard Deviation
|
Quartiles divide an ordered dataset into four equal parts, and refer to the values of the point between the quarters. Quartiles are a useful measure of spread because they are much less affected by outliers or a skewed data set than the standard deviation. For this reason, quartiles are often reported along with the median as the best choice of measure of spread and central tendency, respectively, when dealing with skewed and/or data with outliers.
A common way of expressing quartiles is as an interquartile range. The interquartile range (IQR) describes the difference between the third quartile (Q3) and the first quartile (Q1), telling us about the range of the middle half of the scores in the distribution. The IQR is often seen as a better measure of spread than the range as it is not affected by outliers
Range is the difference between the smallest value and the largest value in a dataset. Range is used if there are less than 5 trials that are being used to calculate a measure of central tendency..